Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Text extraction from natural scene image: A survey

Identifieur interne : 000182 ( Main/Exploration ); précédent : 000181; suivant : 000183

Text extraction from natural scene image: A survey

Auteurs : HONGGANG ZHANG [République populaire de Chine] ; KAILI ZHAO [République populaire de Chine] ; Yi-Zhe Song [Royaume-Uni] ; JUN GUO [République populaire de Chine]

Source :

RBID : Pascal:13-0324769

Descripteurs français

English descriptors

Abstract

With the increasing popularity of portable camera devices and embedded visual processing, text extraction from natural scene images has become a key problem that is deemed to change our everyday lives via novel applications such as augmented reality. Text extraction from natural scene images algorithms is generally composed of the following three stages: (i) detection and localization, (ii) text enhancement and segmentation and (iii) optical character recognition (OCR). The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. This paper aims to classify and assess the latest algorithms. More specifically, we draw attention to studies on the first two steps in the extraction process, since OCR is a well-studied area where powerful algorithms already exist. This paper offers to the researchers a link to public image database for the algorithm assessment of text extraction from natural scene images.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Text extraction from natural scene image: A survey</title>
<author>
<name sortKey="Honggang Zhang" sort="Honggang Zhang" uniqKey="Honggang Zhang" last="Honggang Zhang">HONGGANG ZHANG</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>School of Communication and Information Engineering, Beijing University of Posts and Telecommunications</s1>
<s2>Beijing</s2>
<s3>CHN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>République populaire de Chine</country>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Kaili Zhao" sort="Kaili Zhao" uniqKey="Kaili Zhao" last="Kaili Zhao">KAILI ZHAO</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>School of Communication and Information Engineering, Beijing University of Posts and Telecommunications</s1>
<s2>Beijing</s2>
<s3>CHN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>République populaire de Chine</country>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Song, Yi Zhe" sort="Song, Yi Zhe" uniqKey="Song Y" first="Yi-Zhe" last="Song">Yi-Zhe Song</name>
<affiliation wicri:level="4">
<inist:fA14 i1="02">
<s1>School of Electronic Engineering and Computer Science, Queen Mary, University of London</s1>
<s2>London E1 4NS</s2>
<s3>GBR</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Royaume-Uni</country>
<placeName>
<settlement type="city">Londres</settlement>
<region type="country">Angleterre</region>
<region type="région" nuts="1">Grand Londres</region>
</placeName>
<orgName type="university">Université de Londres</orgName>
</affiliation>
</author>
<author>
<name sortKey="Jun Guo" sort="Jun Guo" uniqKey="Jun Guo" last="Jun Guo">JUN GUO</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>School of Communication and Information Engineering, Beijing University of Posts and Telecommunications</s1>
<s2>Beijing</s2>
<s3>CHN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>République populaire de Chine</country>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">13-0324769</idno>
<date when="2013">2013</date>
<idno type="stanalyst">PASCAL 13-0324769 INIST</idno>
<idno type="RBID">Pascal:13-0324769</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000043</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000725</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000025</idno>
<idno type="wicri:doubleKey">0925-2312:2013:Honggang Zhang:text:extraction:from</idno>
<idno type="wicri:Area/Main/Merge">000185</idno>
<idno type="wicri:Area/Main/Curation">000182</idno>
<idno type="wicri:Area/Main/Exploration">000182</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Text extraction from natural scene image: A survey</title>
<author>
<name sortKey="Honggang Zhang" sort="Honggang Zhang" uniqKey="Honggang Zhang" last="Honggang Zhang">HONGGANG ZHANG</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>School of Communication and Information Engineering, Beijing University of Posts and Telecommunications</s1>
<s2>Beijing</s2>
<s3>CHN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>République populaire de Chine</country>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Kaili Zhao" sort="Kaili Zhao" uniqKey="Kaili Zhao" last="Kaili Zhao">KAILI ZHAO</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>School of Communication and Information Engineering, Beijing University of Posts and Telecommunications</s1>
<s2>Beijing</s2>
<s3>CHN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>République populaire de Chine</country>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Song, Yi Zhe" sort="Song, Yi Zhe" uniqKey="Song Y" first="Yi-Zhe" last="Song">Yi-Zhe Song</name>
<affiliation wicri:level="4">
<inist:fA14 i1="02">
<s1>School of Electronic Engineering and Computer Science, Queen Mary, University of London</s1>
<s2>London E1 4NS</s2>
<s3>GBR</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Royaume-Uni</country>
<placeName>
<settlement type="city">Londres</settlement>
<region type="country">Angleterre</region>
<region type="région" nuts="1">Grand Londres</region>
</placeName>
<orgName type="university">Université de Londres</orgName>
</affiliation>
</author>
<author>
<name sortKey="Jun Guo" sort="Jun Guo" uniqKey="Jun Guo" last="Jun Guo">JUN GUO</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>School of Communication and Information Engineering, Beijing University of Posts and Telecommunications</s1>
<s2>Beijing</s2>
<s3>CHN</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>République populaire de Chine</country>
<placeName>
<settlement type="city">Pékin</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Neurocomputing : (Amsterdam)</title>
<title level="j" type="abbreviated">Neurocomputing : (Amst.)</title>
<idno type="ISSN">0925-2312</idno>
<imprint>
<date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Neurocomputing : (Amsterdam)</title>
<title level="j" type="abbreviated">Neurocomputing : (Amst.)</title>
<idno type="ISSN">0925-2312</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithmics</term>
<term>Alignment</term>
<term>Augmented reality</term>
<term>Boarded computer</term>
<term>Camera</term>
<term>Character recognition</term>
<term>Computer vision</term>
<term>Document image processing</term>
<term>Extraction process</term>
<term>Illumination</term>
<term>Image enhancement</term>
<term>Image processing</term>
<term>Information extraction</term>
<term>Information retrieval</term>
<term>Localization</term>
<term>Luminance</term>
<term>Natural scenes</term>
<term>Optical character recognition</term>
<term>Pattern extraction</term>
<term>Scene analysis</term>
<term>Segmentation</term>
<term>Text</term>
<term>User interface</term>
<term>Virtual reality</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Extraction information</term>
<term>Texte</term>
<term>Analyse scène</term>
<term>Calculateur embarqué</term>
<term>Traitement image</term>
<term>Réalité virtuelle</term>
<term>Interface utilisateur</term>
<term>Recherche information</term>
<term>Vision ordinateur</term>
<term>Localisation</term>
<term>Reconnaissance optique caractère</term>
<term>Reconnaissance caractère</term>
<term>Appareil photographique</term>
<term>Réalité augmentée</term>
<term>Alignement</term>
<term>Luminance</term>
<term>Procédé extraction</term>
<term>Extraction forme</term>
<term>Algorithmique</term>
<term>Segmentation</term>
<term>Eclairement</term>
<term>Scène naturelle</term>
<term>Accentuation image</term>
<term>Traitement image document</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Réalité virtuelle</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">With the increasing popularity of portable camera devices and embedded visual processing, text extraction from natural scene images has become a key problem that is deemed to change our everyday lives via novel applications such as augmented reality. Text extraction from natural scene images algorithms is generally composed of the following three stages: (i) detection and localization, (ii) text enhancement and segmentation and (iii) optical character recognition (OCR). The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. This paper aims to classify and assess the latest algorithms. More specifically, we draw attention to studies on the first two steps in the extraction process, since OCR is a well-studied area where powerful algorithms already exist. This paper offers to the researchers a link to public image database for the algorithm assessment of text extraction from natural scene images.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Royaume-Uni</li>
<li>République populaire de Chine</li>
</country>
<region>
<li>Angleterre</li>
<li>Grand Londres</li>
</region>
<settlement>
<li>Londres</li>
<li>Pékin</li>
</settlement>
<orgName>
<li>Université de Londres</li>
</orgName>
</list>
<tree>
<country name="République populaire de Chine">
<noRegion>
<name sortKey="Honggang Zhang" sort="Honggang Zhang" uniqKey="Honggang Zhang" last="Honggang Zhang">HONGGANG ZHANG</name>
</noRegion>
<name sortKey="Jun Guo" sort="Jun Guo" uniqKey="Jun Guo" last="Jun Guo">JUN GUO</name>
<name sortKey="Kaili Zhao" sort="Kaili Zhao" uniqKey="Kaili Zhao" last="Kaili Zhao">KAILI ZHAO</name>
</country>
<country name="Royaume-Uni">
<region name="Angleterre">
<name sortKey="Song, Yi Zhe" sort="Song, Yi Zhe" uniqKey="Song Y" first="Yi-Zhe" last="Song">Yi-Zhe Song</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000182 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000182 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:13-0324769
   |texte=   Text extraction from natural scene image: A survey
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024